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ABSTRACT 

> 

\^ • We revisit the longstanding question of whether first brightest cluster galaxies are 

Q>^ \ statistically drawn from the same distribution as other cluster galaxies or are "special" , 

, using the new non-parametric, empirically based, model presented in I Vale fc Ostrikerl 

(|2006t) for associating galaxy luminosity with halo/subhalo masses. 
\ We introduce scatter in galaxy luminosity at fixed halo mass into this model, 

■ building a conditional luminosity function (CLF) by considering two possible models: a 
' simple lognormal and a model based on the distribution of concentration in haloes of a 

given mass. We show that this model naturally allows an identification of halo/subhalo 
^ i' systems with groups and clusters of galaxies, giving rise to a clear central/satellite 

galaxy distinction, obtaining a special distribution for the brightest cluster galaxies 
(BCGs). 

Finally, we use these results to build up the dependence of BCG magnitudes 
^ \ on cluster luminosity, focusing on two statistical indicators, the dispersion in BCG 

magnitude and the magnitude difference between first and second brightest galaxies. 
We compare our results with two simple models for BCGs: a statistical hypothesis 
that the BCGs are drawn from a universal distribution, and a cannibalism scenario 

■ merging two galaxies fror n this distribution. The statis tical model is known to fail 
' from work as far back as iTremaine fc Richstond (|1977[ ) . We show that neither the 

statistical model nor the simplest possibility of cannibalism provide a good match 
for observations, while a more realistic cannibalism scenario works better. Our CLF 
models both give similar results, in good agreement with observations. Specifically, 
we find < TOi > between -25 and -25.5 in the K-band, <j{mi) ^ 0.25 and < A12 > 
between 0.6 and 0.8, for cluster luminosities in the range of 10^^ to lO^^h^'^ljQ. 

Key words: galaxies: haloes - galaxies: fundamental parameters - galaxies: clusters: 
general - dark matter - methods: statistical 
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1 INTRODUCTION 

The nature of brightest cluster galaxies (B CGs) has long 
been a subject of interest and much debate (|Peeblesl 1 19681 : 
ISandad Il972l : iDressleij Il978l ). In particular, investigators 
have asked whether their origin is statistical or special in 
nature, that is, whether they follow a special distribution 
independent of the fainter galaxies in the cluster, or on the 
contrary, they are merely the extreme values of the same 
global distribution derived for all cluster galaxies. 
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On the theoretical side, there has been renewed in- 
terest in this subject with recent studies of the relation 
between galaxies and their dark matter haloes from a 
theoretical, statistical point of view, involving the study 
of the distribution of the galaxy population through dif- 
ferent haloes while bypassing the co mplications of the 
physics of galaxy formatio n (e.g., Berlind fc Weinberg 200^: 



Vale fc Qstriker 2004; Tasi tsiomi et all |2004|; lYang et all 
20051: IZehavFet al.i i2005i: iZheng et all l2005l: [Cooravl I2OO6I: 



Conrov. Wechsler fc Kravtsovl 20061 : Ivan den Bosch et al.l 
20061 ). Since these involve populating dark matter haloes 
with galaxies, they usually lead to a distinction between 
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central and satellite galaxies. This in turn has lead to, in 
many of these works, central galaxies being treated sepa- 
rately from the rest, and therefore having a distinct dis- 
tribution, with consequences visible, for example, in the 
luminosity function. Some of these studies have in fact 
looked at some specific BCG-re lated properties of clus- 
ters, like the magnitude g ap fe.g.. iMilosavlievic et al.|[2006l : 
Ivan den Bosch et all [2009) . 

In the past, observ ational studies which have 
focused on this issue llXremaine fc Richstone 19771: 



Hoessel, Gunn & Thuan"l980"; 'Schneider, Gunn, & Hoessell 
1983; Bhavsar & Barrow 1985; Hocsscl & Schneider 19851; 
Bhavsaii 19891 : iPostman fc Laueill995l : lBernstein fc Bhavsarl 



20001 ) have been hindered by the limited numbers of high 
luminosity galaxy observations available, since the strongly 
declining nature of the bright end of the luminosity function 
requires having very large samples to obtain significant 
numbers of high luminosity galaxies. Due to this, these 
studies were mostly inconclusive when it came to answer 
the question of whether BCGs were statistical or special 
in nature, although many works hinted at the latter. 
More recently, the advent of large scale surveys such as 
the 2dF Galaxy Redshift Survey or the Sloan Digital 
Sky Survey (SDSS), has r notivated plentiful, ongoini 



work on th i s sub j e ct (e.g.. iLin. Mohr fc Stanford! 1200 



Un fc MohJ 12004 iLo h ^ 
2006: von der Linden et al 



Strauss! l20ol " 
200^ ). 



iBernardi et al.l 



This issue is in large part motivated by the fact that, 
observationally, BCGs do look different from other galaxies. 
They usually sit at the centre of the cluster, and tend to 
be considerably brighter than the remaining cluster mem- 
bers. The most striking case is cD galaxies, found in the 
centre of rich clusters and which dominate their satellites 
in both size and brightness, while having a characteristi- 
cally disti nct morphology and surface br i ghtne ss distribu- 
tion (e.g.. iBinggeli. Sandage fc TammannI Il988l ). Likewise, 
cD galaxies tend to be brighter than what would be ex- 
pected from the bright end of the cluster galaxy luminos- 
ity function. In fact, it has been observed that, when an- 
alyzing composite luminosity functions of cluster galaxies, 
the most luminous of them form a hump at the brigh t 
end (e.g., ICoUesd 1 19891 : lYagi et al.l I2OO2I : lEke et ahllioM ). 
Yet, at the sa me time, there is little variation in magnitude 
among them |Hoes scl fc Sc hneideJll985l: iPostman fc Lauen 
I1995I : IBernardi et^al. 2006 : von der Linden et al.ll2006l '). This 
ties in with the fact that the luminosity of BCGs is ex- 

J iected to vary only slowly with increasing cluster luminosity 
Lin fc M ohr 2004; see also the results for the mass luminos- 
ity relation of central galaxies in I Vale fc Ostrik"erll2006l ). 

In order to try to answer this problem from available 
observational data, two different indicators have been con- 
sidered. One is the shape of the overall distribution of the 
magnitude of BCGs. If BCGs are merely the extreme cases 
of a general distribution applicable to all cluster galaxies, 
then it is expected that results from extreme value the- 
ory in statistics apply, predicting a re sulting distribution 
shap e d like the Gumbel d istribution (iBhavsar fc Barrowl 
1 19851 : iBernstein fc Bhavsaii 2000). On the other hand, ff 
BCGs are considered a special, distinct type of galaxy, then 
some par ticular distribution is to be expected, such as a 
Gaussian l|Postman fc Laue"illl995l ) or lognormal. Some stud- 
ies have also raised the possibility that it could be actually 



a combina tion of the two, probably depending on th e type 
of cluster (|Bhavsaij|l9"89l : IBernstein fc Bhavsaiilioool ). 

The other property studied is the ratio r = Ai2/cri, 
where A12 is the average magnitude difference between the 
first and second brightest galaxies, and ai the dispersion in 
the magnitude of the first brig htest galaxy. It is possible to 
prove the powerful conclusion (|Tremaine fc Richstondfig??! ) 
that, if all galaxies are drawn from the same statistical dis- 
tribution, regardless of its exact form, then r < 1. Observa- 
tional results give a value f or r around 1.5 (e.g.. lLin fc Mohrl 
12004 iLoh fc Strauss|[200^ . which would exclude this possi- 
bility. 

This has led to the study of possible alternative 
scenarios for the formation of BCGs, in order to ac- 
count for their special nature. One such is galactic can- 
nibal ism, initially proposed by Ost riker and collabora- 
tors JOstriker fc Trem ainc 1975; Ostr iker fc Hausmanlll977l : 
iHausman fc Ostrikeiill978l ). Such a scenario is akin to tak- 
ing the above case of having all galaxies drawn from the 
same distribution, but then merging the brightest of them 
with one or more of the others. From this simplistic model 
of the process, it is easy to see that this mechanism would 
help to solve the above problem, mostly by increasing the 
value of A12 as the luminosity of the first brightest galaxy 
is driven up by the mergers and the brightness of the sur- 
viving second brightest galaxy declines as luminous galaxies 
are merged out of existence. 

In the present paper, we explore this issue in light 
of the non-parametric model for th e mas s luminosity rela- 
tion presen ted in [Va lc fc Ostrike^ (|2006l ) (hereafter paper 
I; see also IVale fc Ostrikeii i2004f ). The basic idea behind 
the non-parametric model is to adopt the simple proposi- 
tion that more luminous galaxies are hosted in more mas- 
sive haloes/subhaloes. No attempt at physical modelling is 
made and the association is made simply by matching one- 
to-one the rank ordered observational list of galaxies with 
the rank ordered computed list of haloes/subhaloes. We here 
extend this model by introducing scatter into it, and also by 
considering possible effects on the total disruption of some 
subhaloes into the total luminosity rel ated to the halo. 

As is the case in HOD models (e.g., | Berlind fc Weinberg! 
l2002l : IZehavi et ahlbOOSl : IZheng et al.!l2005h . this model nat- 
urally gives rise to a separation between central and satellite 
galaxies, by associating the former with the parent halo it- 
self and the latter with the subhaloes associated with it. We 
analyze this issue in more detail, studying how it affects the 
cluster galaxy luminosity function and gives rise to a bright 
end bump caused by the central galaxies. We then develop a 
model for the BCG luminosity distribution. Since the halo in 
fact arises from the union of subhaloes this non-parametric 
model is a statistically well defined variant of the cannibal- 



ism scenario. 



This paper is organised as follows: in section 2, we give a 
brief summary of the non-parametric model relating galaxy 
luminosity with halo/subhalo mass presented in paper I, and 
introduce a simple recipe for checking the contribution of de- 
stroyed subhaloes to the halo mass, and how this changes 
our estimate of the total luminosity. In section 3, we intro- 
duce scatter into the non-parametric model by building a 
conditional luminosity function, where we consider two pos- 
sibilities for it, either a simple lognormal shape or a better 
motivated approach involving the distribution of concentra- 
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tion for haloes of a given mass. In section 4, we explore 
more indepth how the model gives rise to a central/satellite 
galaxy separation, and show how this impacts the cluster 
galaxy luminosity function. In section 5, we build up a model 
for the distribution of cluster galaxies, based on the mass- 
luminosity relation and the halo/subhaloes separation which 
underpins it. In section 6, we present simple models to ac- 
count for another two possible origins for the BCG distri- 
bution: first, we consider that all cluster galaxies are drawn 
from the same distribution; then we take a simple model for 
cannibalism, by merging two of the galaxies (the brightest 
plus one other) in the first example. Finally, in section 7 we 
present the results of all models for the average magnitude 
of first and second brightest galaxies as well the dispersion 
of the former as a function of cluster luminosity. We then 
compare these results with observations. 

Throughout we have used a concordance cosmological 
model, with = .24, Qa = 0.76, h = 0.735 and as = 0.74 
jSpergel et al.ll2006l ). 



2 THE MASS-LUMINOSITY RELATION 

The work presented below is based on the non-parametric 
model for relating galaxy luminosity with halo/subhalo mass 
presented in paper I. The basic idea is that more massive 
haloes/subhaloes have deeper potential wells and will thus 
accrete more gas and subsequently will have more luminous 
galaxies forming within them. In effect, we take the relation 
between galaxy luminosity and halo/subhalo mass to be one 
to one and monotonic. An additional extra ingredient is nec- 
essary to maintain this approximation in the framework of 
the model, since subhaloes lose mass to the parent halo after 
accretion due to tidal interactions. Alternatively put, a halo 
is not simply the sum of the identifiable subhaloes within 
it due to tidal stripping. Therefore, we need to account for 
the mass of the subhaloes not at present, but that which 
they had at the time of their merger into the parent. The 
relation between mass and luminosity is then obtained sta- 
tistically by matching the numbers of galaxies with the total 
number of hosts, that is, haloes plus subhaloes through their 
distributions. 

The halo a bundance is given by th e usual Sheth-Tormen 
mass function (|Sheth fc Tormenll 19991 ): 



rih 



{M)dM = a(i + ±-] ,/ 2 ^^expf - ^ 
with u = \fa-jj^--^ 



a = 0.707, A 



dM , (1) 
0.322 and q = 0.3; 



(z)CT(Af)' 

as usual, o{M) is the variance on the mass scale M, D[z) is 
the growth factor, and 5c is the linear threshold for spherical 
collapse, which in the case of a fiat universe is 5c = 1.686, 
with a small correction dependent on Q,m {5c = 1.673 for 
a„ = 0.24). 

Following the discussion in paper I, we will assume a 
very simple model for the subhalo mass distribution within 
the parent. In terms of their original, pre-accretion mass, we 
assume that the subhalo distribution is given by a simple 
Schechter function: 

N(m\M)dm = A(M){m/ l3My°'e^p{'-ra/ l3M)dm/ I3M , (2) 

where the cutoff parameter /3 = 0.5 serves to in- 
sure that no subhalo was larger than half the present 



mass of the parent (otherwise it would, by definition, 
be the parent). The slope a = 1.9 is set to the same 
value as is generally found for the present day sub- 



halo mass function in simulations (e.g.. Gao et al.l 


2004 


WeUer et al.l 20051: 


van den Bosch. Tormen & Giocolil 


2005 


Zentner et all 12005 


: IShaw et al.l 120061). The normalization 



A{M) = l//3[r(2 - a) - r(2 - q, 1)] is set so that the to- 
tal mass originally in subhaloes corresponds to the present 
day mass (where the integration is done to an upper limit 
of 0.5M). This approximation potentially ignores the prob- 
lem of total disruption of some of the merged subhaloes, as 
can occur for example in the case of major mergers, by as- 
suming that all of these subhaloes are still present and that 
therefore the total fraction of m ass originally in subhaloes 
IS one. In I Vale fc Ostriked l|2006l ). we showed that as long as 
this fraction is close to one, then the resulting mass lumi- 
nosity relation is similar, with both number of satellites in 
a halo and their total luminosity decreasing slightly. From 
the study of simulation results it is still not completely clear 
how to treat this complex issue, and no simple analytical 
models are available, so we explore a simple recipe to bet- 
ter account for this problem in the context of our model in 
section 12.11 

The galaxy distribution is given by the luminosity func- 
tion. This is given by the usual Schechter function fit: 

L\dL_ 



(j)obs{L)dL ^ (t),{ — \ expf-— J- 



(3) 



The values of the parameters will depend on the wave- 
band used. In this paper, we use mostly the K-band lu- 
minosity function from the 2MASS survey, with parame- 
ters given hj a = -1 09, (t>* = 1.16 x 10~ ^/t^Mpc~^ and 
M, - 51og/i = -23.39 iKochanek et al.ll200ll ). For compari- 
son, we also obtain the 6j-band 2dF survey, with a = —1.21, 
= 1.61 X 10~^/ t^Mpc-^ and M. - 51og/i = -19.66 
Norberg et al.ll200^ ). Also note that we are in fact extend- 
ing these fits as necessary, including beyond the magnitude 
interval in which they were obtained. 

The basic mass-luminosity relation can then be ob- 
tained from these ingredients by a counting process, match- 
ing the numbers of galaxies at a given luminosity to the total 
number of hosts at a given mass: 



4>{L)dL= / {nh{M) + nsh{M))dm , 



(4) 



where the host contribution is separated into a halo term, 
nh{M), and a subhalo term obtained by summing up all the 
subhaloes at that mass, Ush. (m) = N{m\M)nh{M)dM. 
An average relation between host mass and galaxy lumi- 
nosity can then be built through this process, with results 
that match well with observations (see paper I for a detailed 
analysis). The resulting relation can be well fit by a double 
power law of the type: 



LrefiM) = Lo 



[M/Mo) 



[1 + {M/MoY''Y/^ 



(5) 



where the differents parameters are shown in tabled mass is 
in units of /i~^M0, luminosity in /i~^Lq. The fit was done 
in the mass range 10^^ (3 x 10^" in the hj band case) to 
3 x lO^'^ h~^MQ. 

Figure [1] shows the results for the luminosity of a single 
galaxy as a function of the mass of the hosting halo/subhalo, 
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K-band 



fej-band 



Lo 

Mo 

a 

b 

k 



1.37 X 10"' 
6.14 X 10^ 
21.03 
20.74 
0.0363 



4.12 X lO'' 
1.66 X lO^o 

6.653 

6.373 

0.111 



Table 1. Fit parameters for the mass-luminosity relation. 



together with the corresponding mass-to-Ught ratio. Shown 
are curves for both K- and bj bands. We caution that the 
results for the latter band should be treated with some re- 
serve. This counting method is not entirely adequate to get 
the mass-luminosity relation in the blue, due to complica- 
tions arising from recent star formation, although it is still 
interesting to compare the differences obtained from using 
two different luminosity functions. 




M (h Ms 



Figure 1. Mass-luminosity relation as obtained using the non- 
parametric model, in the K- and bj— bands. Upper pannel shows 
galaxy luminosity normalized to the characteristic luminosity, L* , 
of each band; lower pannel shows the corresponding mass-to-light 
ratio. 



2.1 Destroyed subhaloes and the subhalo mass 
fraction 

As mentioned, a potentially important correction to the non- 
parametric model in paper I is to account for subhaloes 
which have been completely destroyed. The study of this 
evolution of the subhalos and their eventual destruction, 
with the subsequent merger (or not) of their galaxy with 
the central one, is a very interesting topic by itself, which is 
still not completely understood but which is essential to a 
complete understanding of the formation of BCGs. However, 
such a detailed look at this question is beyond the scope of 
the present paper; here, we are merely interested in a simple 
model to account for how much mass was in these destroyed 
subhalos, to correct the normalization of our original sub- 
halo mass function. 

Our scheme is based on the fact that most of the lu- 
minosity of the central galaxy is built up by merging with 
the satellite galaxies brought in by these subhaloes. In other 
words, the central (BCG) optical galaxy is made up of the 
galaxies that have been "merged away" - disappeared from 
the original distributions. This is consistent with what is 
known of the size, shape and colour properties of central 
galaxies. We therefore assume that the fraction of mass in 
these destroyed subhalos (with respect to the total halo 
mass) , is given by the ratio of the central galaxy luminosity 
to the total luminosity of the halo: 



fdest — 



ITldest 
M 



Ltotal 



(6) 



For a given L{M) relation, which sets the luminosity of 
both the central and satellite galaxies as a function of the 
halo/subhalo mass, the previous equation can then be solved 
for fdest as a function of halo mass, since the total lumi- 
nosity is going to be a function of only it and the total 
mass; Ltotai = Lcent{M) -1- (1 - fdest)Lsat,max{M), where 
Lsat,max is the maximum contribution of the satellites for 
when fdest ~ 0. This is given by: 



-^sat^max 



(M) 



Lrefim)N{ni\M)dm. 



(7) 



The upper pannel of figure [5] shows the destroyed mass 
fraction as a function of halo mass for our base mass- 
luminosity relation, given by equation ([5]), while the bottom 
pannel shows the effect on the total luminosity. As can be 
seen, this is most pronounced at the lower end of the mass 
scale shown, and becomes small enough to have little effect 
at high mass. 

There are two additional factors that need to be noted. 
First, the introduction of this term can also have an effect 
on the actual mass-luminosity relation, since we are using a 
counting method to obtain it. However, for the mass range 
we are interested in, the number counts are dominated by 
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Figure 2. Upper panncl: total mass in subhaloes which have 
been completely disrupted, as a fraction of the total halo mass. 
Bottom pannel: total luminosity as a function of halo mass with or 
without using the fraction of mass in destroyed subhaloes shown 
in the upper pannel. Results are for the K-band, using the base 
mass-luminosity relation of equation JSj. 

the central galaxies and will therefore not be affected (see 
section 4). Likewise, the only haloes capable of hosting sub- 
haloes large enough to be counted in this range are the most 
massive ones, for which the effect is smallest (see section 3). 
Secondly, in principle, this approach will also depend on the 
exact form of the mass- luminosity relation. However, for the 
small deviations from the base relation we will be consider- 
ing in this paper, the effect on the total luminosity is small, 
since the variation to the base relation will be greatest at 
higher mass, where this effect is smallest. We will therefore, 
for simplicity, use this one result throughout the paper. 



Finally, it needs to be stressed that this is just a very 
simple approximation. The calculated factor is applied to the 
whole subhalo mass fraction as a correction to the normal- 
ization, without taking into account a possible dependence 
on subhalo mass. In particular, the situation with very low 
mass subhaloes is very uncertain in this scheme, since they 
are expected to be very faint, and have therefore very little 
weight in the sum of the total luminosity, while they can 
contribute an important fraction of the mass. Another im- 
portant point is that in principle the luminosity of the galax- 
ies that were contained in the destroyed subhaloes should 
be added to the central galaxy luminosity, since under this 
scheme we are assuming that these are merging. In practice, 
though, the light in these destroyed subhaloes is going to be 
small in comparison with the BCG in this model, since the 
largest fraction of destroyed subhaloes occurs for less mas- 
sive haloes where the BCG is dominant. For simplicity, we 
will here ignore this contribution. 



3 INTRODUCING SCATTER 

3.1 The conditional luminosity function 

In the context of the present paper, we need a more de- 
tailed model than the one described previously. Most im- 
portantly, it needs to include some kind of scatter in the 
mass-luminosity relation. Naturally, we expect that not all 
galaxies in hosts of the same mass will have the same lumi- 
nosity. To capture this, we introduce a dispersion around 
the average relation describe above. We use the condi- 
tional lumi n osity function ( CLF) formalism introduce d by 
lYang et al.1 (120031 ') (see also Ivan den Bosch et"aLll2006l and 
references therein) and by Cooray and collaborators (e.g., 
[Cooray & Milosavljcvic 2005a. ; Coorav 2006 and references 
therein). This consists of replacing a deterministic mass- 
luminosity relation, like the one in equation ([5]), with a dis- 
tribution of luminosity around an average value for any given 
halo mass, (j)cLF{L\M)dL, which represents the probability 
of having a galaxy of luminosity L in a halo of mass M. Note 
that here we are only applying this to the central galaxy in 
any given halo, since that is the important one for the study 
of BCGs; the distribution of satellite galaxies we draw di- 
rectly from the distribution of subhalo masses. 

An important point is that this CLF must, by definition, 
match the observed luminosity function when it is integrated 
over all haloes, i.e. JJ^ (f>cLF{L\M)n{M)dM — 4>{L), where 
n{M) is the halo mass function, </>(!/) the observed luminos- 
ity function and L should only be considered in the range 
where the haloes dominate the number of hosts (i.e., at high 
luminosity, which is precisely the range we are interested in 
when looking at BCGs;otherwise, we would also need to ac- 
count for subhalo contribution). The introduction of scatter 
then leads to a problem with the mass-luminosity relation 
derived from th e coun ting method, however. As noted by 
iTasitsiomi et al.l (|2004l ). the fact that the mass function is 
decreasing with increasing mass causes an effect similar to 
the Malmquist bias: for any given mass bin, more objects 
are scattered into it from lower mass bins than are scattered 
out of it. If we then take our base mass- luminosity relation 
to be the average one in the CLF distribution, because of 
this effect we will end up with a calculated luminosity tunc- 
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tion that greatly overestimates the abundance of very bright 
galaxies when compared to the observed one. 

To get the correct matching to the observed luminosity 
function, it is then necessary to modify the average mass- 
luminosity function we take for the basis of the CLF. This 
is achieved by introducing an additional term, of the form: 



L(M) = Lref{M){l + M/M, 



(8) 



where Lref{M) refers to the base mass-luminosity relation 
of equation ((S)). In practice, a is going to be negative since 
we need to lower the luminosity corresponding to any given 
high-mass halo in order to drive the value of our calculated 
luminosity function down. 

There is one final, potentially important point about 
this issue: once scatter is introduced, care must be taken 
when looking at the calculated average mass-luminosity re- 
lation. In our approach, the average luminosity at fixed mass 
needs to go down, relative to the scatter-less case or, looking 
at it the opposite way, the same average luminosity is ob- 
tained for higher mass haloes. This is due to the fact that, 
when considering the CLF, we are doing the binning by mass 
(or more precisely, taking the conditional variable in the dis- 
tribution to be the mass) . If we had instead binned by lumi- 
nosity, the effect of introducing scatter would have been the 
opposite: the average mass correspoding to a given luminos- 
ity would instead have gone down. This is to be expected 
and is just a statistical effect of the two different ways in 
which the conditional function can be defined. It does how- 
ever mean that care must be taken when comparing results 
of different authors to look at how the binning was done in 
each case. 

In this paper we will consider two different models for 
the CLF of the central galaxy: a simpler model where we 
assume the distribution is lognormal, but where we are left 
with a free parameter in the scatter introduced; and a more 
complicated one based on the distribution of concentrations 
at a given halo mass, which fully motivates the introduc- 
tion of scatter in the CLF without any free parameters. For 
the satellite galaxies we will use the same modified mass- 
luminosity relation as well, since these were central galaxies 
within their own independent haloes prior to merging, so it 
is reasonable to expect the same effects to apply to them. 
From semi-analytical modelling, it has been shown that this 
is a good approximation, although a more careful treatment 
shows a sligh tly different relatio n for sattelites than for cen- 
tral galaxies (|Wang et al.ll200^ . But note that, when doing 
analytical calculations, using the subhalo mass function al- 
ready introduces a form of distribution for the subhaloes as 
well (in that the mass of a given subhalo can be drawn from 
it, see section 5.1 for further discussion). 

3.2 Lognormal model 

The simplest CLF model we consider is to assume it has a 
lognormal form. Th is is similar to what was done p r eviously 
by o ther authors jCoorav fc Milosavlievid l2005al : ICooravl 
I2OO6I ). and such a form seems a good match to the distri- 
bution of stellar m ass obtained in semi-analytic modelling 
I Wang et al.ll200^ ). The problem with this approach is that 
there is no a priori reason to assume any specific value for 
the dispersion. Furthermore, this value is linked to the mod- 
ified mass-luminosity relation of equation (|8} , so it needs to 



be defined in some way in order to determine the latter. 
We do this by determining which value of the dispersion 
leads to an average luminosity as a function of mass which 
best fits observational values. For simplicity, we will consider 
that the value of the dispersion, a, is constant and indepen- 
dent of mass. Since we are only interested in the bright end, 
where we expect central galaxies to dominate, and these 
are known to have only a small scatter in lu minosity (e.g., 
iPostman fc Laueilll995l : iBernardi et al.l [20061 ). this is quite 
likely a good approximation. Semi-analytical modelling also 
shows that scatter in st ellar mass is only a weak function of 
halo mass (|Wang et al.| [2006'l. 

The luminosity L of a galaxy in a host of mass M, is 
then given by a lognormal distribution of the type: 



(ln[L/Lo(Af)]+ai^/2)- 



4>CLF(L\M)dL = —= exp 

V^TVaLNL ^ 

where Lo{M) is some average luminosity for a host of mass 
M (discussed further below), and a is the dispersion in the 
normal logarithm of the luminosity. Note that there is some 
confusion in the literature over the exact form defined for the 
lognormal distribution. First, it is necessary to pay attention 
to whether the distribution is in the natural logarithm or 
base 10 logarithm of the variable; the quoted value of a 
will be different in the two cases for the same distribution. 
Secondly, the way we have defined it in equation ((9}, the 
average luminosity is given by Lo{M). This is due to the 
second term in the denominator of the exponential term, 
which not all authors include; if it is omitted, Lo would not 
be the average luminosity. 

The difficulty with using an approach such as this is that 
we are left with two unknowns we need to determine, the ref- 
erence luminosity function Lo and the dispersion a. The only 
condition we can impose on this distribution is that, when 
integrated over all hosts, the resulting luminosity function 
must match the observed one. Fitting this calculated lumi- 
nosity function to the observed one then allows us to relate 
the two parameters we have when we take Lo — L{M) from 
equation a and Ms, to the scatter a. However, this still 
leaves us with one free parameter which we cannot other- 
wise specify. In order to address this problem, we determine 
which value of the scatter gives us an average luminosity as 
a function of mass that best fits the observed data. Since our 
original mass-luminosity relation was already quite a good 
fit to the data (see paper I), we must necessarily have only a 
small correction to it (i.e., a small value of a), which in turn 
implies a small value for the scatter, which is qualitatively 
in good agreement with observations (see further discussion 
in section 7). 

3.3 Concentration model 

The other model for the CLF of BCGs we consider is based 
on the variation of the concentration of haloes with the same 
mass. The basic idea behind this is that the distribution 
of concentration in same meiss haloes will lead to different 
mass in the inner region of the halo where the galaxy will be 
present; the luminosity of the hosted galaxy will then sim- 
ply be proportional to this mass. In practice, we calculate 
the mass-to-light ratio of this inner region, for the average 
concentration and with an average luminosity given by equa- 
tion ([8]), and then calculate the change in luminosity as the 



dL , (9) 
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concentration changes by assuming that the mass-to-light 
ratio is fixed (observationally, it has been noted that the dy- 
namical mass-to-hg ht ratio of BCGs is almost constant, e.g. 
Ivon der Linden eral,.200& ). This then gives us the BCG lu- 
minosity as a function of both concentration and halo mass. 

Although the model is conceptually simple, the details 
are problematic. The main issue is to determine what ex- 
actly is this inner region and how to calculate its mass. The 
most obvious solution, taking quoted values from the liter- 
ature for BCG radius and its dependence on luminosity, is 
not really satisfactory since these are most often determined 
from isophotal limits and in the case of cD galaxies it would 
be necessary to further consider whether to include the en- 
velopes; also, it is natural to assume that the actual region of 
influence for the dark matter is more extensive than the vis- 
ible galaxy. Other options such as some parameter from the 
dark matter structure, run into the problem of motivating 
what exactly it should be. 

In the end, after checking the results of several different 
possible models, we concluded that the one that has the best 
motivation and also gives the best results is to calculate the 
inner region mass by using a weighting function based on 
the luminosity profile of the BCGs. 

Since we just require it to make our weighting function, 
for simplicity we assume that the luminosity profile of BCGs 
can be universally fit by a Sersic profile: 



I(r) = Aexpi^b^ir/r.y/" ^ 1]) , 



(10) 



where yl is a normalization factor, and r(2n) = 2^{2n,b„), 
where r(2n) is the gamma function and 7(2n, x) the incom- 
plete gamma fu nction; this can be well approximated by 
fe„ « 2n — 0.327 (|Capacciolilll989t ). It is know that there is a 
correlation between the profile parameters n and r^, and also 
between and the galaxy luminosity L, alt hough the cor- 
relati on between n and L is very weak (e.g., ICraham et al.l 
1 19961 ) . For simplicity, we will assume that we can relate both 
parameters to the galaxy luminosity; although in practice 
this is not really true, for our purposes here it is a suffi- 
cient, if rou gh, approximation . Based on results from the 
literature (|Graham et al.lll996l : iLin fc Moh3l2004 l. we use 
the following relations: 



n = 2.86941og(r,) + 2.0661, 
log(re) = 0.95231og(L) - 9.1447, 



(11) 
(12) 



where L is the K-band luminosity; th ese are similar to wha t 
is reported by other authors (e.g., iBernardi et al.l [20061 ). 
Our weighting function is then given not by the actual 
profile, but rather the integration factor for the luminos- 
ity, w(r) — rl{r), and the normalization A chosen so that 
JJ^""" w{r)dr — 1. Finally, the inner region mass is simply 
obtained by integrating the mass density times the weight- 
ing function: 



4:Tvr ■w(r)p{r)dr . 



(13) 



We use the usual NFW profile for the density 
iNavarro. Frenk. fc Wh"it3[r997i ). 



P = 



Pi 



(14) 



where x — r/vs, with Vs — Vvir/cvir and the virial radius is 



r„,> = {3M / An A^ir py^^ and A^ir = 387; pi is normalized 
to give the halo mass at the virial radius. 

For the concentra tion distr ibution, we take th e model 
of lBuUock et al.l l|200ll) (see also lMaccio et al.ll2006l . who get 
similar results). This relates the average concentration of a 
halo of mass M with the scale factor at its collapse, Oc, given 
by: 



a{fM) 



(15) 



where a is, as usual, the variance of the linearly extrapolated 
power spectrum of perturbations, D{a) is the growth factor 
and Sc = 1.673 the linear threshold for collapse; / = 0.001 
is a parameter. The concentration is then given by c„ir = 
fc/ac, with the parameter fc = 3. Finally, the distribution of 
the concentration is given by a lognormal distribution with 
average c„ir and variance (j[log(c^ir)] ~ 0.18. 

The BCG luminosity distribution is then obtained 
from the concentration distribution by 0(L|M) = 
f{c\M)/[dL/dc), where f(c\M) is the concentration distri- 
bution as a function of halo mass. As mentioned, the lumi- 
nosity for any given concentration and halo mass is given 
by 



L(c, M) 



inner 

{M/L)o ' 



(16) 



where {M/L)o is the mass-to-light ratio with the inner mass 
calculated at the average concentration and the luminos- 
ity given by our mass- luminosity relation, equation ([SJ, and 
Minner is giveu by equation (|13p . Finally, we fit our calcu- 
lated luminosity function to the observed one, in order to 
determine the parameters that go into the modified mass- 
luminosity relation (see table 2). 

Note that both iBullock et all l|200ll ') and iMaccio et all 

(|2006t ) find that subhaloes tend to have higher concentra- 
tions than parent haloes of the same mass. Although it goes 
beyond the scope of the present work, it is wortwhile to men- 
tion that in the framework of the model just presented, this 
can possibly lead to slightly different distributions of lumi- 
nosity as function of mass for the subhaloes, although this 
is most likely complicated by the fact that we need to take 
the subhalo properties at accretion rather than at present. 

Table 2 shows the values we obtain for the parameters of 
the modified mass- luminosity relation of equation Q. Fig- 
ure [3] shows examples of the actual distribution we obtain 
for the BCG luminosity, both for the lognormal model and 
for the concentration model. 



4 CENTRAL VS SATELLITE GALAXIES 

As was briefiy mentioned above, the way we build up the 
mass-luminosity relation naturally gives rise to a model for 
clusters, featuring a distinct separation between central and 
satellite galaxies. This comes from the fact that we consider 
that galaxies are hosted by both the parent halo and the 
subhaloes. Since we consider that the same mass-luminosity 
relation applies for both, and the former will be, by defini- 
tion, considerably more massive than the latter, this results 
in there being a central, very luminous galaxy, hosted by 
the parent halo, while fainter satellite galaxies are spread 
throughout in the subhaloes. 
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model o"Ljv i Ms 

concentration N/A -0.08 lO^^'^ 

lognormal 0.265 -0.07 10^^ 

Table 2. Fit parameters for the modified mass-luminosity relation of equation (|8]l, for the concentration and lognormal models. The 
dispersion in the latter is fixed to the value shown, and variable in the former. 



concentration model, M= 
concentration model, IVI= 



lognormal model, M=10^^ h"^ 




Ms, 
Ms, 
M., 



mass and its distribution, without the need to account for 
surviving subhaloes. 

It is in fact possible to derive the different contributions 
to the global luminosity function from the central and satel- 
lite galaxies, by associating them with the halo and subhalo 
distributions, respectively, and then using the CLF formal- 
ism presented in the previous section: 



10' 



Figure 3. Examples of the calculated distribution for the BCG 
luminosity, for both the lognormal and concentration models, and 
for two different values of the halo mass. 



(PCLF 



(17) 



where the i indexes refer to the haloes and subhaloes, respec- 
tively. The derived luminosity functions for central and satel- 
lite galaxies are shown in figure|4] Unsurprisingly, the central 
galaxies completely dominate the overall luminosity function 
at high luminosity, with their numbers becoming comparable 
to the satellites only at low luminosity. This simply reflects 
the trends seen in the halo and subhalo numbers (see pa- 
per I). The expected relative cont ributions of b o th typ es of 
galaxies are still uncertain: while iBenson et al.l (|2003al ) us- 
> ing their semi-analytical m odelling fin d satellite galaxies to 
dominate at the faint end, Cooray fc M ilosavljcvic (20051]) 
using a conditional luminosity function formalism find that 
central galaxies do minate throughout the range (likewise, 
IZheng et al.l (|2005l ) find that central galaxies dominate the 
stellar mass function on any mass scale). 



Of particular importance for the question of whether 
the first brightest galaxies are special, this separation im- 
plies that these galaxies should indeed have a special lumi- 
nosity distribution, independent from that of the remaining 
galaxies in the cluster. This comes from the fact that the dis- 
tribution functions of these two types of galaxies will be dif- 
ferent in origin: the central galaxies one will be determined 
by the halo mass function, while the satellite galaxies one 
will depend on the subhalo mass function. This dichotomy 
is also found in HOD models, for instance when accounting 
for total galaxy o ccupation number fe.g.. l\^ng et al.ll2005l : 
IZheng et al]|200d ): while P(iVlA/), the probability that a 
halo of mass M hosts A'^ galaxies, is Poissonian at high A'^, 
where satellite galaxy numbers dominate, it is significantly 
sub-Poissonian at low A*', indicating that the distribution of 
the central galaxies is much more deterministic. 

This has an important consequence, derived from the 
fact that, at high mass, and therefore also at high luminos- 
ity, the total mass function is dominated by the haloes, not 
subhaloes. This means that, when analysing the luminos- 
ity function of galaxies in clusters, the brightest region will 
be dominated by the central galaxies, which will actually 
be more abundant overall than the brightest of the satel- 
lite galaxies. We then expect that this will cause a feature 
in the cluster galaxy luminosity function at the bright end; 
this point will be examined in further detail below. Another 
consequence is that we expect the luminosity of the central 
galaxy in the cluster to be completely determined by halo 



4.1 Definition of cluster threshold mass 

A necessary first step before continuing this analysis is to 
define precisely what we mean by a "cluster". We will opt 
for a simple choice, following the standard Abell definition 
of rich cluster, namely that it must have upwards of 30 ob- 
jects brighter than ms -\- 2™, where ms is the magnitude of 
the third brightest galaxy in the cluster. It is then possible, 
following our model, to translate this into a minimum mass 
threshold for a halo to host a cluster, as follows. 

The third brightest galaxy will correspond to the second 
most massive subhalo (since the brightest galaxy is hosted 
by the parent halo itself), and the probability of this having 
a mass m2 is then: 

P2{ms,2,Mh) = N{msa\Mh) <N> e^^^^^ , (18) 

where N{ms,2\Mh) is the mass distribution function of the 
subhaloes, equation ([2}, and < N >= N{m'\Mh)dm' 
is the average number of subhaloes more massive than m in 
a parent halo of mass M^- This expression assumes that the 
distribution of subhalo masses is Poissonia n with average 
< >, as expected for the subhaloes (e.g.. iKravtsov et al.l 
12004 since we are looking at cluster sized haloes, < N > 
will be large in this case), and it is simply the product of 
the probability of having a subhalo with mass m2, given 
by the first term on the right hand side, by the Poisson 
probability of having exactly one subhalo more massive than 
m2. Using the fact that d < N > /dm — —N{ms\Mh), it is 




Figure 4. Contribution to the high-end luminosity function of 
central and satellite galaxies. These particular curves are for the 
-K-band, lognormal model, but the results are similar for the con- 
centration model. It is very noticeable that the halo numbers 
dominate in this luminosity range. The relative satellite contri- 
bution to the total luminosity f unction is qualitatively sim ilar to 
what is found by other authors l|van den Bosch et al.1120061) . 

easy to check that this probability is well normalized to 1. 
The average value of the magnitude corresponding to this 
subhalo, ma, can then be calculated from the distribution 
by: 

<m3(Mh)>= / m{ms)P2ims,Mh)dms , (19) 
Jo 

where m{ms) represents the corresponding magnitude as a 
function of the subhalo mass, calculated using the mass- 
luminosity relation from section 2. In this instance, we have 
used the simpler relation of equation ([SI, since it greatly 
simplifies the calculations and using the full CLF results in 
only a slight difference. 

Using this magnitude we can then obtain ma -I- 2™ , and 
then convert this back into a mass threshold, mt{Mh), which 
will be dependent on the parent halo mass. Finally, we can 
find the probability, as a function of M^, that N{mt\Mh) > 
29 (giving more than 30 objects above the magnitude limit, 
when including the central galaxy). Since we are assuming 
that the subhalo distribution is Poissonian, this will simply 
be given by -^('^i /^)' "*^here P(n, /i) is the normal 

Poisson probability with average A* =< N{mt\Mh) >■ This 
gives a smooth transition for the mass of cluster hosting 
haloes, shown in figure O starting around a halo mass of 
Mh ~ lO^''/i~^M0, but which depends on the luminosity 
function considered. 

4.2 Cluster galaxy luminosity function 

Once we have a mass threshold for haloes hosting clusters, 
obtaining the cluster galaxy luminosity function is straight- 
forward: we simply sum up the mass function of haloes above 
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0.8 - 



0.6 - 
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0.4 - 
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Figure 5. Probability that a halo of mass M contains more than 
30 galaxies brighter than ma + 2"* and is considered a rich cluster 
according to the usual Abell definition. Using either the bj or the 
K bands to do the counting results in the two different curves. 



Concentration model 
Lognormal model 
Global LF 




Figure 6. Luminosity function of galaxies in rich clusters, in the 
K-hand. The two curves correspond to the two models for the 
dispersion used, described in the previous section. Also shown, 
for comparison, is the global LF. Only galaxies above the mass 
threshold shown in figure[5]were considered. 

this threshold with the mass function of all the subhaloes 
hosted by them (i.e., use equation ([2}, but further multi- 
plied by a term to refiect the probability that the halo does 
indeed host a rich cluster, given by the result shown in figure 
[5|. Then, we transform this into a luminosity function using 
the CLF. Our result is shown in figure [6] 
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Qualitatively, we obtain a good agreement with 
observed luminosity functions (such as the one of 
Ide Propris et"IDl2003l . although in this particular case a di- 
rect comparison is difficult because these results are in a 
different band; redoing our analysis in the same band pro- 
duces a good match), particularly in the lower luminosity 
range. At the bright end, there is some disagreement caused 
by a particular feature of our result, a bump in the lumi- 
nosity function at the bright end. ft is simple to understand 
that this bump is caused by the central galaxies. The dis- 
crepancy in numbers between central and satellite galaxies 
comes from the fact that, at high luminosity, the contribu- 
tion from parent haloes to the total luminosity distribution 
(shown in figure [Jjl completely dominates over the subhalo 
one. This is a refiection of the fact that haloes are much more 
abundant at the high mass end than subhaloes (see paper 
I). In fact, it can be seen from the figure that these central 
galaxies essentially correspond to the high luminosity end 
of the global luminosity function. This is hardly surprising, 
since we can expect the most luminous galaxies to lie at the 
centre of the most massive clusters. 

Such a feature is thus a natural consequence of the 
model: very luminous galaxies will predominantely be cen- 
tral galaxies of high mass haloes, which will therefore domi- 
nate in number over satellite galaxies of the same luminosity; 
at the same time, since we introduce a lower mass limit to 
rich cluster hosting haloes, the faint end contribution to the 
luminosity function of galaxies in such clusters will come en- 
tirely from satellites. It is important to note that this is not 
a particularity of this specific model: any model associat- 
ing central galaxies with parent haloes and satellite galaxies 
with subhaloes will show a similar feature, due to the dis- 
crepancy in numbers between the two at high mass (though 
it may also require that this be associated with high lumi- 
nosity in both cases; or, more particularly, that the same 
mass-luminosity relation is used for both haloes and sub- 
haloes, as is the case in the model used here). 

A note of caution comes from the fact that the shape of 
the bump depends on the dispersion in the CLF: the smaller 
it is, the sharper the bump will be. Furthermore, the actual 
shape of the bump is determined by the cluster definition 
being used, through the cluster threshold mass discussed in 
the previous section. This cutoff mass is responsible for the 
decreasing values of the cluster galaxy luminosity function 
on the left side of the bump; a lower threshold mass would 
result in a wider bump. Taking this threshold to lower and 
lower values (beyond the range where it would be reasonable 
to assume the presence of clusters) results in the progressive 
disappearance of the bump as we naturally regain the overall 
global luminosity function. It should be stressed, however, 
that the presence of a bump is a fundamental prediction of 
the model, independent of the precise cluster definition being 
used, since it is a direct consequence of the discrepancy in 
numbers between haloes and subhaloes at high mass. 

This kind of feature is also present in some recent work 
dealing with HOD models and t he centra l/satellite galax y 
separation. In lZheng et all l|2005l 'l (see also lZhu et al.ll200d ). 
the authors use a semi-analytical model of galaxy formation 
to obtain the conditional galaxy baryonic mass function. For 
high mass haloes in the range we are considering here, they 
also obtain a high mass bump in this function caused by the 
central galaxy. They show that the baryonic mass function 



can be described by combining a Schechter function repre- 
senting the satellite galaxies contribution with a high mass 
gaussian due to the central galaxy. Likewise, IZehavi et al.l 
( ,2005. ) build up HOD models from SDSS results, and ana- 
lyze the central/satellite galaxy split; from this, they build 
conditional luminosity functions, and show that their results 
imply that the central galaxies lie far above a Schechter func- 
tion extrapolation of the satellite population. The observa- 
tional work also hints at similar features: for example, it is 
known that cD galaxies are brighter than what is given by 
the bright end of the cluster galaxy lu minosity function (e.g., 
iBinggeli. Sandage fc TammanrJ 19881 ) . This is also present in 
studies of the lu minosity function of galaxies in clusters (e.g., 
et al.l I2OO4I ) . All this once again reinforces the notion 
that central galax;ies form a special distribution, essentially 
separate from the satellite galaxy one. 



5 BUILDING THE BCG LUMINOSITY 
DISTRIBUTION 

As discussed above, the non-parametric model used natu- 
rally builds a picture of galaxy clusters. This translates itself 
into a procedure to build up the total luminosity distribu- 
tion of galaxies within a halo of a given mass. This will be 
composed of two steps, one dealing with the satellite galaxies 
in the subhaloes, another with the central galaxy. 

5.1 Satellites 

In this step, we need to sum up the total luminosity in the 
satellite galaxies contained in the halo. We start by taking 
the total number of subhaloes in a given halo, as calculated 
from the SHMF (that is, the occupation number; see paper 
I) , as an average number for a parent halo of this mass, 
taking into account the effect of destroyed subhaloes as in- 
troduced in section 2.1. In this step it is necessary to specify 
a minimum mass: we take a low enough value to ensure that 
we account for all subhaloes massive enough to give a notice- 
able contribution to the total luminosity of the halo. We then 
assume that the total number of subh aloes follows a Poisso n 
distribution (as discussed above; e.g.. iKravtsov et al.ll20oj '). 

For each subhalo, up to a total as calculated from the 
Poisson distribution, we determine a mass, by assuming the 
subhaloes follow a random distribution given by the subhalo 
mass function ([2)|. Then we convert this to the luminosity 
of the hosted galaxy using the mass luminosity relation. 

Finally, once we have the total number of subhaloes (as 
determined initially from the Poisson distribution), we can 
sum all of their calculated luminosities to obtain the total 
in satellite galaxies. 

At the same time, the average luminosity of the bright- 
est satellite galaxy can be calculated in a more direct fash- 
ion. Using the subhalo mass distribution, we can get the 
probability distribution of the mass of the most massive sub- 
halo. Analogously to what was done in the previous section 
for the second most massive subhalo (see equation llSp. this 
will be given by 

Pi(mi,Mh) =iV(mijMft)e-<'^>, (20) 

where < TV >= N(m'\Mh)dm' as before. Used together 
with the mass luminosity relation, the average luminosity of 
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the galaxy hosted in the most massive subhalo (and there- 
fore the most luminous of the satellites) is simply given by 

< Li > (Mh) = /J^L(mi)Pi(mi,Mh)dmi. 

5.2 Central galaxy 

The way we have built up the CLF gives us a natural way of 
obtaining the distribution of the luminosity of first brightest 
galaxies with cluster luminosity, since we are already intro- 
ducing a distribution with mass. We use the results of both 
the lognormal and concentration models for comparison. 

The total luminosity is obtained by summing over the 
BCG and satellite contributions, and taking into account 
the effect of the destroyed subhaloes from section 2.1. The 
average total luminosity at any given mass is simply the sum 
of the average luminosities of the BCG and satellites, and 
is, by construction, equal to the one shown in figure O 

Finally, we can also obtain the global distribution of 
BCG luminosity over all clusters. To do this, we use the 
cluster threshold mass, as calculated in the previous section, 
and simply integrate the conditional distribution we have 
multiplied by the halo mass function; 



/(Li)=/ f{L^\A4)n{M)p{M)dM , 



(21) 



where n(M) is the halo mass function, given by ((T|, and 
p{M) is the probability that a halo of mass Af hosts a rich 
cluster, as given in figure [5] 



6 OTHER MODELS FOR BCGS 

In this section, we introduce two other simple models to 
complement the one presented above in order to allow better 
comparisons with the observational results. The first, and, 
a priori, best motivated is based on the simple assumption 
that the BCGs are merely the extreme values of the unique 
distribution that applies to all cluster galaxies. We do this 
based on a regular cluster galaxy luminosity function, and 
build the BCG distribution directly from it. The second is 
based on assuming some form of cannibalism. We use a sim- 
ple approach to model this mechanism, that of merging the 
brightest galaxy with one other, where both are taken from 
the universal distribution in the first case we consider. 



6.1 Extremes of a general distribution 

The simplest assumption possible when studying the dis- 
tribution of the galaxies in a cluster is that they are all 
drawn from the same statistical distribution (for example, 
a Schechter function for luminosity). The galaxies in a clus- 
ter would then simply be a random sample drawn from this 
distribution, with the brightest galaxy simply the extreme 
value of this sample. This approach has been studi ed in the 
literature before (e.g.. ITremaine fc Richston3ll977^ . In par- 
ticular, it has been shown that this approach leads to a result 
from extreme value theory, the Gumbel distribution, for the 
overall dist ribution of the magnitudes of the first bright- 
est g alaxy l|Bhavsar fc Barrow! 1 19851 : iBernstein fc Bhavsarl 



(22) 



bOOd ). This is given by 



with M" = Mg + ^^^p-, where Mq is the mean of the mag- 
nitude values and a is a measure of the steepness of fall of 
the parent distribution. 

In the present paper, we take a slightly different ap- 
proach, in order to match that which we will take for 
the other models. This model and subsquent calculations 
are similar to the ones presented in ITremaine fc Richstond 
(1977 ). although some details, like the luminosity function 
used, will be different. We begin by assuming that the par- 
ent distribution of the luminosity of the galaxies in a cluster 
is given by a Schechter function, equation ([Sjl (for simplicity, 
we use the same values for M« and the faint end slope, a, 
as the global luminosity function). 

The only dependence on the actual cluster considered 
comes in the normalization, which we set so that the total 
luminosity in all the galaxies equals the cluster luminosity, 



.(ic) = 



r[2 ~a]L, 



(23) 



We then take the value of the distribution of the first 
brightest galaxy at a given luminosity to be the probabil- 
ity that there are no galaxies brighter than that luminosity, 
times the probability that there is a galaxy at that luminos- 
ity. The latter is simply given by (|3]), while for the former 
we take a Poisson fluctuation around the average number 
of galaxies obtained by integrating the parent distribution, 
((3|. Thus, the luminosity distribution for the brightest clus- 
ter galaxy is given by: 



/i(L,Le) = <^(L,Le)e L 



,p{L,La)dL 



(24) 



where the integral in the exponential can be resolved to 
(j>{L)dL = (j>,{Lc)T[l - a,L/L,]. Using the same prmci- 
ples as discussed in section 14.11 above, it is simple to show 
that this probability distribution is adequately normalized 
to 1. 

Similarly, the probability for the luminosity of the sec- 
ond brightest galaxy is given by the product of the probabil- 
ity of having a galaxy at a given luminosity by the probabil- 
ity of there being a single galaxy brighter than that luminos- 
ity. In general, and again taking Poisson fluctuations around 
the average number, the distribution of the n-th brightest 
galaxy will be given by: 



(l,{L')dL')"e~JL 



{L')dL' 



/n!.(25) 



Likewise, the joint probability of having the flrst bright- 
est galaxy at luminosity Li and the second at L2 (given by 
the product of the probability of a galaxy at L2 , another at 
Li, none in between and none above Li) can also be derived: 



fi2(Li,L2,Lc) = (t){L-i,Lc)(j}{L2,Lc)t 



4,(L)dL 



(26) 



f{M) 



a(M-M*)-e°<"-*='*) 



Once we have these distributions, it is then easy to 
obtain the various statistics we are interested in: the av- 
erage magnitude of the first and second brightest galaxies, 
< mi > and < 1x12 >, the dispersion in first brightest 
galaxy magnitude, a\, and the average magnitude differ- 
ence between these two < A12 >=< mi — m2 >. Some 
caution is necessary with the last one, since the fact that 
L\ and L2 are not independent variables would necessitate 
the use of the joint distribution; however, it turns out that 



12 A. Vale and J. P. striker 



the integrals over this joint distribution resolve to two dif- 
ferent integrals over the two separate distributions, so that 
< Ai2 > = < mi > — < m2 > as calculated for each sepa- 
rate, individual distribution. 



6.2 Cannibalism 

To illustrate the effect of galactic cannibalism, we consider 
here two simple models, starting with a common distribu- 
tion like the one discussed in the previous section, and then 
merging the brightest galaxy with one of the others. 

In general, the distribution of the new BCG will have to 
be taken from the joint distribution of the previous first and 
n-th brightest galaxies. It is important to note that these 
variables are not independent, and as such when calculating 
the average of the sum it will not, in principle, be possible 
to separate it into integrals over the two individual distri- 
butions (except in the particular case when n = 2, as noted 
above) . 



6.2.1 L1+L2 

The simplest possible model to consider when taking ac- 
count of cannibalism is to merge the two brightest galaxies 
from a common distribution. Thus, the luminosity of the 
brightest galaxy is now given by the sum of the luminoisi- 
ties of the previous two brightest galaxies, with a distribu- 
tion given by the joint distribution of equation (|26|l . The 
second brightest galaxy is now the old third brightest, with 
a distribution given by equation (I25|l . with n — 3. 

The remaining calculations follow the same pattern as 
discussed in the previous case. It is to be expected that, due 
to the fact that the new first brightest luminosity is the re- 
sult of the sum of two previous variables, the dispersion in 
first brightest luminosity will be slightly lower than before. 
Likewise, it is obvious that the difference in magnitude be- 
tween first and second brightest galaxies will now be much 
larger. 



6.2.2 Li+L„ 

A slightly more realistic toy model to illustrate galactic can- 
nibalism is to proceed similarly as described above, but in- 
stead of merging the first brightest galaxy with the second, 
to take some weighting function to reflect the probability 
that the merger will occur with any one other of the galax- 
ies in the cluster. 

We take the probability of the merger occuring with the 
n-th brightest galaxy to be proportional to its average lumi- 
nosity, L„; this gives greater weight to the first few bright- 
est galaxies, but since the average luminosity decreases only 
slowly with n, the probability is split over a wide range. We 
consider a possible merger down to the 30th brightest galaxy 
in the cluster, for which the merger probability given by this 
prescription will be below 1%. 

The total probability distribution of the new BCG lu- 
minosity is obtained from the joint probability distribution 
of the different weighted pairs. Since the variables are not 
independent and therefore this joint distribution cannot be 
split into a product of terms each dependent only on a single 



variable, to calculate < mi >, < m2 > and cri it is neces- 
sary to solve complicated integrations. We turn instead to 
a Monte Carlo method, building up the distribution of the 
new BCG luminosity by randomly generating merger pairs 
and the luminosity of their components, and then summing 
them. The new second brightest galaxy will be either the 
old one, or the old third brightest, depending on whether 
the merger occurred with the former or not. 



7 RESULTS 

We start by showing the results for average magnitude of 
first and second brightest galaxies as a function of total clus- 
ter luminosity, for each of the models, in figures [7] and [S] The 
first noticeable thing is that the curves for both the concen- 
tration and lognormal models are very similar, which is not 
too surprising given the similar mass-luminosity relations 
obtained for each of them (see table 2), even though they 
were built in independent ways. This is in fact a success for 
the concentration model, since the lognormal model is by 
construction made to provide the best fit to the observed 
BCG magnitude, while the concentration model is not. 

Comparing the different sets of curves, it is easy to see 
the differences between the models. The statistical distri- 
bution has the lowest average BCG luminosity, which is 
considerably higher for the other models. This comes from 
the fact that all other models take the BCG distribution to 
be special, like an additional distribution added on to the 
base Schechter distribution of the satellite galaxies. Looking 
at the average magnitude of the second brightest galaxy, 
the most obvious factor is that the one from the more ex- 
treme cannibalism model is much lower than the others; once 
again, this is unsurprising since this is essentially the third 
brightest galaxy in the statistical model. Likewise, the H-n 
cannibalism model gives essentially the same result as the 
statistical one, since the second brightest galaxy is the same 
in both in most cases. Both of our models give considerably 
brighter values for the second brightest galaxy. This proba- 
bly indicates that the luminosity function we used to gen- 
erate the statistical and cannibalism models does not have 
enough bright galaxies (i.e., L* is too low), which is not very 
surprising since we are using the parameters of the global 
one. On the other hand, if this were to be the case then it is 
quite probable that the cannibalism models would then give 
BCGs which are too bright. 

In any case, this problem should be less of an issue 
when looking at the magnitude gap A12, shown in figure 
[9l The values for the cannibalism 1-1-2 model are obviously 
much too high to match the observed ones: it is clear that 
merging the two brightest galaxies leaves too big a gap to 
the next brightest. The values for the statistical model are 
too low, in this case probably indicating that it is the BCG 
which is too faint. The values for the other, 1-l-n, cannibalism 
model look rather better, while both of our models show 
good agreement with the observations. But note the size of 
the errorbars in the observational data: the scatter in the 
observed values is quite la rge (see also .Loh fc Strauss 20061: 
Ivan den Bosch et al.l [200^ ) . Since the scatter in the BCG 
magnitude is small, most of the scatter here in A12 is likely 
coming from the second brightest galaxy. The shape seen 
in the curves of figure [9] is unsurprising: the fact that A12 
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Figure 7. Average magnitudes, in the _R'-band, of tlie first bright- 
est galaxy, as a function of the total cluster luminosity, as calcu- 
lated for the different models: the ones based on the CLF for- 
malism introduced, the statistical model and two forms of canni- 
balism: the extreme Li + L2, and the softer Li -|- Ln- The data 
points shown are binned values of cluster galaxy data supplied by 
Yen-Ting Lin (2006, private communication). 
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Figure 9. Values for the average magnitude difference between 
first and second brightest galaxies, A12. The data points come 
are again binned values from the catalogue supplied by Yen-Ting 
Lin. The large errorbars reflect the fact that the scatter in the 
values of A12 in these clusters is very large. For comparison, 
another observation al value for A12, but in the i?-b and, is of 
^ 0.55 magnitudes l|Schneider. Gunn. fc Hoessellll983l ). The val- 



ues we obtain are a lso 
iMilosavlievic et al.l 



ual itatively consistent w ith the results of 
200i ): lLoh fc Strausd l|2006h . 
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Figure 8. Same as figure [7] but for the second brightest galaxy 
in the cluster. 



is increasing as the total luminosity goes down is a natural 
consequence of the decreasing number of satellites at this 
lower end. In fact, these correspond to haloes of only a few 
times 10^'^/i~^Mq , and therefore many of these systems will 
not actually even be clusters (cf figure [S}. 

Figure [TO] shows the calculated dispersion in the magni- 
tude of the first brightest galaxy. Binning the Lin observa- 
tional data results in values of cri of 0.3 to 0.4 magnitudes. A 
direct comparison of this value with the model results shows 
that the values we obtain for our model are slightly too low, 
while the values for the statistical and cannibalism mod- 
els look reasonable. There is, however, an additional point 
to bear in mind: these observational values are obtained by 
binning over a certain cluster luminosity range. Part of this 
observed scatter then simply comes from the fact that the 
average magnitude is changing as a function of this. When 
taking this into account, the values we get from our model 
are in fact in good agreement with the observed ones. Also, 
using Bayes' theorem, it is possible to derive values for the 
dispersion in the mass of the hosting halo for a given galaxy 
luminosity. Doing so we obtain values which are in qualita- 
tive a greement with the ones found by Ivan den Bosch et al.l 
l|2006l ). although a direct comparison is complicated by the 
fact that they use 6j-band instead of isT-band for the galaxy 
luminosity. 
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Figure 10. Dispersion in the magnitude of the first brightest 
galaxy. The data set used to plot the data points in the previous 
figures gives a value of between 0.3 and 0.4 magnitudes for the 
statistical error in each cluster luminosity bin. Another observa- 
tionally measured dis per sion in first brightest magnitude, in the 
S-band, is tri ~ 0.24 jPostma n & Lauer 1995). Using a CLF ap- 
proach similar to ours. lCooravl (2006h obtains a value of cri =0.17 
in the r-band. 



8 CONCLUSION 

In the present paper we focus on the implications of the mass 
luminosity relation first introduced in paper I on the prop- 
erties of clusters, and more in particular on whether BCGs 
are statistical or special in nature. We expand the model 
of paper I by introducing scatter into the relation, building 
a CLF based on two different models: a simple lognormal 
shape or a model based on the distribution of concentra- 
tions in haloes of a given mass. We also introduce a simple 
model to evaluate the mass fraction in subhaloes which have 
been completely disrupted since they were accreted. 

We have argued that this model naturally gives a sep- 
aration between central and satellite galaxies in a cluster, 
with the former having a distinct distribution based on the 
halo mass function. We have shown that this leads to a 
characteristic bump in the cluster galaxy luminosity func- 
tion, qualitatively similar to that seen in some observational 
work and some semi-analytical HOD based models. This is 
caused by the fact that, at any given high mass, haloes are 
considerably more abundant than subhaloes, coupled with 
the fact that, in any single system, central galaxies will be 
considerably brighter than the satellites (since the subhaloes 
are much less massive than the parent halo). Together with 
this, the faint end is completely determined by the satellite 
galaxies in the subhaloes, since we put in a minimum mass 
threshold for the haloes we consider host clusters. 

Finally, to look at the question of the nature of BCGs, 
we study some statistical indicators that may provide a clue 
to this problem, namely the ratio r between the magnitude 



difference between first and second brightest galaxies in a 
cluster, Ai2, and the dispersion of the former, ai. We also 
introduce two simple models to account for two different 
possibilities that are usually considered: the statistical hy- 
pothesis, that is, that all galaxies in a cluster, including the 
BCG, are drawn from the same distribution; and galactic 
cannibalism, where the BCG grows by merging with other 
galaxies in the cluster. As is already known, any model of 
the former gives a value for r which is too small compared to 
what is observed. At the same time, we show that the sim- 
plest case of the latter, that of merging the two brightest 
galaxies from a common distribution, gives a value of A12 
which is far too large. 

From the results we obtain, it is possible to draw some 
answers to the issue discussed in the introduction about the 
nature of BCGs. The statistical hypothesis, which assumes 
that BCGs are drawn from the same universal distribution, 
can be ruled out. It gives values of ai which are too big and 
A12 which are too small. 

On the other hand, the simplistic model we analyse for 
galactic cannibalism looks to be far too extreme. Mainly, 
it gives values of A12 which are far too large compared to 
what is observed. This model mimics the scenario of two 
similar sized clusters merging, with a final distribution of 
galaxies which can be well fit by a single distribution, but 
where the BCG is then built up by merging the two BCGs 
of the original clusters. From our results, we expect that 
if such a scenario does occur, a merging of the brightest 
galaxies is excluded, as it leaves a too bright BCG and too 
large a ga p to the second br ightest galaxy (but see the dis- 
cussion m iLin fc Moh3l2004D . A more general cannibalistic 
scenario, where the BCG is built up by merging with one 
other galaxy, is however not excluded, nor is the possibility 
of minor mergers, where one of the merging systems is much 
smaller than the other such that its brightest galaxy will not 
be the second brightest galaxy in the resulting cluster. It is 
worth noting that this fits in well wit h the results of re- 
cent s emi-analytic simulations done by Ide Lucia fc BlaizotI 
(|2006[ ). who find that BCG growth occurs fast enough that 
major mergers are relatively rare, and that most of the later 
growth is through minor mergers. 

Our model gives results which can be regarded as 
halfway between the other two: the BCG is in fact the prod- 
uct of a special distribution and is considerably brighter than 
what would result from a single distribution, but the second 
brightest galaxy is not as faint as in the cannibalistic sce- 
nario. The question remains, however, of what is the BCG 
formation scenario expected in this case. A satisfactory an- 
swer, in the framework of the way the mass luminosity re- 
lation is built, would require going back to the simulations 
and following the behaviour of the halo and its subhaloes. 

There is an important assumption which has been im- 
plicit throughout this paper: that both the central and satel- 
lite galaxies follow the same mass luminosity relation. This 
may indeed appear to be contradictory with the possibility 
that the central galaxies are special, since it may seem to 
imply that they are formed through the same processes. On 
the other hand, the model is based on structure build up 
through the merger of dark matter haloes, and we assume 
that the satellite galaxies were formed in their own inde- 
pendent haloes, prior to being accreted into their present 
parent halo. This original halo would be the one that deter- 
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mines their properties, since the galaxy stops accreting gas 
or undergoing mergers of its own once its halo merges into 
the parent system and it becomes a satellite (and hence the 
need for some mass loss prescription). Therefore, it is to be 
expected that at least in some ceises, they would have been 
a BCG in their own system themselves, and therefore it may 
not be unreasonable to assign them the same mass luminos- 
ity relation. Still, this leaves out the very important factor 
that the formation epoch may well be different, together 
with subsquent growth of the BCG in the parent system. At 
the same time, halo numbers completely dominate at high 
mass (and therefore it is to be expected the same is true 
of central galaxy numbers at high luminosity), and conse- 
quently the total average mass luminosity relation should 
be pretty much the same as calculated. 

We should stress the fact that our concentration models 
gives quite good results, since unlike the lognormal model, it 
does not involve any fitting of parameters to BCG results. In 
fact, it is quite interesting that it gives values for the disper- 
sion in BCG magnitude so close to the ones from the lognor- 
mal model, when the latter was set to the value that results 
in the best fit to the observed BCG magnitudes. In light of 
the recent discussion about the effects of additional param- 
eters, such as environ mental density, halo formation time 
or co ncentration (e.g., IWechsler et al. I I2OO6I : iBerlind et al.l 
l2006t ). on the clustering of halos and subsequently on the 
HOD, this seems to indicate that taking the halo mass as 
the primary determinant of the hosted galaxy luminosity, 
and then taking concentration as a secondary variable which 
determines the scatter around the average value is a good 
way to proceed. 
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